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Figure 1. N-linked of 



larides for glycosylation 



Peptide sequence (SEQ ID N0:5): 
MPFVNKQFNY KDPVNGVDIA YIKIPNAGQM 
PPPEAKQVPV SYYDSTYLST DNEKDNYLKG 
STIDTELKVI DTNCINVIQP DGSYRSEELN 
GSTQYIRFSP DFTFGFEESL EVDTNPLLGA 
RVFKVNTNAY YEMSGLEVSF EELRTFGGHD 
KSIVGTTASL QYMKNVFKEK YLLSEDTSGK 
LNRKTYLNFD KAVFKINIVP KVNYTIYDGF 
GLFEFYKLLC VRGIITSKTK SLDKGYNKAL 
ITSDTNIEAA EENISLDLIQ QYYLTFNFDN 
KKYELDKYTM FHYLRAQEFE HGKSRIALTN 
AMFLGWVEQL VYDFTDETSE VSTTDKIADI 
AVILLEFIPE lAIPVLGTFA LVSYIANKVL 
VNTQIDLIRK KMKEALENQA EATKAIINYQ 
MININKFLNQ CSVSYLMNSM IPYGVKRLED 
VNNTLSTDIP FQLSKYVDNQ RLLSTFTEYI 
GSKVNFDPID KNQIQLFNLE SSKIEVILKN 
EYTIINCMEN NSGWKVSLNY GEIIWTLQDT 
NNRLNNSKIY INGRLIDQKP ISNLGNIHAS 
EKEIKDLYDN QSNSGILKDF WGDYLQYDKP 
GSVMTTNIYL NSSLYRGTKF IIKKYASGNK 
GVEKILSALE IPDVGNLSQV VVMKSKNDQG 
LVASNWYNRQ lERSSRTLGC SWEFIPVDDG 



QPVKAFKIHN KIWVIPERDT FTNPEEGDLN 
VTKLFERIYS TDLGRMLLTS IVRGIPFWGG 
LVIIGPSADI IQFECKSFGH EVLNLTRNGY 
GKFATDPAVT LAHELIHAGH RLYGIAINPN 
AKFIDSLQEN EFRLYYYNKF KDIASTLNKA 
FSVDKLKFDK LYKMLTEIYT EDNFVKFFKV 
NLRNTNLAAN FNGQNTEINN MNFTKLKNFT 
NDLCIKVNNW DLFFSPSEDN FTNDLNKGEE 
EPENISIENL SSDIIGQLEL MPNIERFPNG 
SVNEALLNPS RVYTFFSSDY VKKVNKATEA 
TIIIPYIGPA LNIGNMLYKD DFVGALIFSG 
TVQTIDNALS KRNEKWDEVY KYIVTNWLAK 
YNQYTEEEKN NINFNIDDLS SKLNESINKA 
FDASLKDALL KYIYDNRGTL IGQVDRLKDK 
KNIINTSILN LRYESNHLID LSRYASKINI 
AIVYNSMYEN FSTSFWIRIP KYFNSISLNN 
QEIKQRVVFK YSQMINISDY INRWIFVTIT 
NNIMFKLDGC RDTHRYIWIK YFNLFDKELN 
YYMLNLYDPN KYVDVNNVGI RGYMYLKGPR 
DNIVRNNDRV YINVVVKNKE YRLATNASQA 
ITNKCKMNLQ DNNGNDIGFI GFHQFNNIAK 
WGERPL 



Peptides containing the motif *N-X-S/T/C (X not P)'(underlined): 

position Peptide SEQ ID NO: 

167-177 SFGHEVLNLTR 

382-393 V NYT IYDGFNLR 

394-415 NTNLAANFNGQNTEINNMNFTK 

418-427 NFTGLFEFYK 

457-477 VNNWDLFFSPSEDNFTNDLN K 

GEEITSDTNIEAAEEN 1 SLD LIQQYYLTFNFDNEPENISI 

^^^'^^^ ENLSSDIIGOLELMPNIER 

773-779 LNESINK 

787-806 F LNQC SVSYLMNSMIPYGVK 

841-855 VMKELSTDIPFQLSK 

872-882 NIIMTSILNLR 



930-948 NAIVYNSMYENFSTSFWIR 16 

952-975 YFNSISLNNEYTIINCMEKNSGWK 17 

100M013 YSOMINISDYINR 18 

1024-1028 LNNSK 19 

1086-1098 DLYDNQSNSGILK 20 

1141-1156 GS VMTTNIYLN SSL YR 21 

1 1 93- 1 204 LATNASOAGVEK 22 

1 205-1 224 ILS ALEIPD VGNLSQ V V VMK 23 



6 
7 
8 
9 
10 

11 

12 
13 
14 

15 



Figure 1 Continue. 

Peptide sequence (DCQ ID TJO, 24) : (|£G^ ID Kio: 3PC) 

KTKSLDKGYN KALNDLCIKV NNWDLFFSPS EDNFTNDLNK GEEITSDTNI EAAEENISLD 

LIQQYYLTFN FDNEPENISI ENLSSDIIGQ LELMPNIERF PNGKKYELDK YTMFHYLRAQ 

EFEHGKSRIA LTNSVNEALL NPSRVYTFFS SDYVKKVNKA TEAAMFLGWV EQLVYDFTDE 

TSEVSTTDKI ADITIIIPYI GPALNIGNML YKDDFVGALI FSGAVILLEF IPEIAIPVLG 

TFALVSYIAN KVLTVQTIDN ALSKRNEKWD EVYKYIVTNW LAKVNTQIDL IRKKMKEALE 

NQAEATKAII NYQYNQYTEE EKNNINFNID DLSSKLNESI NKAMININKF LNQCSVSYLM 

NSMIPYGVKR LEDFDASLKD ALLKYIYDNR GTLIGQVDRL KDKVNNTLST DIPFQLSKYV 

DNQRLLSTFT EYIKNIINTS ILNLRYESNH LIDLSRYASK INIGSKVNFD PIDKNQIQLF 

NLESSKIEVI LKNAIVYNSM YENFSTSFWI RIPKYFNSIS LNNEYTIINC MENNSGWKVS 

LNYGEIIWTL QDTQEIKQRV VFKYSQMINI SDYINRWIFV TITNNRLNNS KIYINGRLID 

QKPISNLGNI HASNNIMFKL DGCRDTHRYI WIKYFNLFDK ELNEKEIKDL YDNQSNSGIL 

KDFWGDYLQY DKPYYMLNLY DPNKYVDVNN VGIRGYMYLK GPRGSVMTTN lYLNSSLYRG 

TKFIIKKYAS GNKDNIVRNN DRVYINVVVK NKEYRLATNA SQAGVEKILS ALEIPDVGNL 

SQVVVMKSKN DQGITNKCKM NLQDNNGNDI GFIGFHQFNN lAKLVASNWY NRQIERSSRT 
LGCSWEFIPV DDGWGERPL 



Peptides containing the motif 'N-X-S/T/C (X not ?)': 



position 


peptide 


SEQ ID NO; 


20-40 


VNNWDLFFSPSEDNFTNDLN K 


25 


41-99 


GEElTSDTNlEAAEENiSLDLIQQYYLTFNFDNEPENISI 


26 


ENLSSDIIGQLELMPNIER 


336-342 


LNESINK 


27 


350-369 


FLNQSSVSYLMNSMIPYGVK 


28 


404-418 


VNNTLSTDIPFQLSK 


29 


435-445 


NIINTSILNLR 


30 


493-511 


NAIVYNSMYENFSTSFWIR 


31 


515-538 


YFNSISLNNEYTIINCMENN SGWK 


32 


564-576 


YSOMINISDYINR 


33 


587-591 


LNNSK 


34 


649-661 


DLYDNOSNSGILK 


35 


704-719 


GSVMTTNIYLNSSLYR 


36 


756-767 


LATNASQAGVEK 


37 


768-787 


ILSALEIPDVGNLSQVVVMK 


38 



Figure 2. 

Peptide sequence (SEQ ID NO: 39): 
MPFVNKQFNY KDPVNGVDIA YIKIPNAGQM 
PPPEAKQVPV SYYDSTYLST DNEKDNYLKG 
STIDTELKVI DTNCINVIQP DGSYRSEELN 
GSTQYIRFSP DFTFGFEESL EVDTNPLLGA 
RVFKVNTNAY YEMSGLEVSF EELRTFGGHD 
KSIVGTTASL QYMKNVFKEK YLLSEDTSGK 
LNRKTYLNFD KAVFKINIVP KVNYTIYDGF 
GLFEFYKLLC VRGIITSKTK SLDKGYNKAL 
ITSDTNIEAA EENISLDLIQ QYYLTFNFDN 
KKYELDKYTM FHYLRAQEFE HGKSRIALTN 
AMFLGWVEQL VYDFTDETSE VSTTDKIADI 
AVILLEFIPE lAIPVLGTFA LVSYIANKVL 
VNTQIDLIRK KMKEALENQA EATKAIINYQ 
MININKFLNQ CSVSYLMNSM IPYGVKRLED 
VNNTLSTDIP FQLSKYVDNQ RLLSTFTEYI 
GSKVNFDPID KNQIQLFNLE SSKIEVILKN 
EYTIINCMEN NSGWKVSLNY GEIIWTLQDT 
NNRLNNSKIY INGRLIDQKP ISNLGNIHAS 
EKEIKDLYDN QSNSGILKDF WGDYLQYDKP 
GSVMTTNIYL NSSLYRGTKF IIKKYASGNK 
GVEKILSALE IPDVGNLSQV VVMKSKNDQG 
LVASNWYNRQ lERSSRTLGC SWEFIPVDDG 



QPVKAFKIHN KIWVIPERDT FTNPEEGDLN 
VTKLFERIYS TDLGRMLLTS IVRGIPFWGG 
LVIIGPSADI IQFECKSFGH EVLNLTRNGY 
GKFATDPAVT LAHELIHAGH RLYGIAINPN 
AKFIDSLQEN EFRLYYYNKF KDIASTLNKA 
FSVDKLKFDK LYKMLTEIYT EDNFVKFFKV 
NLRNTNLAAN FNGQNTEINN MNFTKLKNFT 
NDLCIKVNNW DLFFSPSEDN FTNDLNKGEE 
EPENISIENL SSDIIGQLEL MPNIERFPNG 
SVNEALLNPS RVYTFFSSDY VKKVNKATEA 
TIIIPYIGPA LNIGNMLYKD DFVGALIFSG 
TVQTIDNALS KRNEKWDEVY KYIVTNWLAK 
YNQYTEEEKN NINFNIDDLS SKLNESINKA 
FDASLKDALL KYIYDNRGTL IGQVDRLKDK 
KNIINTSILN LRYESNHLID LSRYASKINI 
AIVYNSMYEN FSTSFWIRIP KYFNSISLNN 
QEIKQRVVFK YSQMINISDY INRWIFVTIT 
NNIMFKLDGC RDTHRYIWIK YFNLFDKELN 
YYMLNLYDPN KYVDVNNVGI RGYMYLKGPR 
DNIVRNNDRV YINVVVKNKE YRLATNASQA 
ITNKCKMNLQ DNNGNDIGFI GFHQFNNIAK 
WGERPL 



Peptides containing S or T (underlined): 



position 


peptide 


SEQroNO: 


49-66 


DTFTNPEEGDLNPPPEAK 


40 


67-84 


QVPVSYYDSIYLSIDNEK 


41 


90-93 


GVIK 


42 


98-105 


lYSIDLGR 


43 


106-113 


MLLTSIVR 


44 


114-128 


GIPFWGGSTIDTELK 


45 


129-145 


VIDTNCINVIQPDGSYR 


46 


146-166 


SEELNLVIIGPSADIIQFEC K 


47 


167-177 


SFGHEVLNLTR 


48 


178-187 


NGYGSIQYIR 


49 


188-212 


FSPDFIFGFEESLEVDINPL LGAGK 


50 


213-231 


FATDPAVTLAHELIHAGHR 


51 


245-264 


VNINAYYEMSGLEVSFEELR 


52 


265-272 


IFGGHDAK 


53 


273-283 


FIDSLQENEFR 


54 


292-299 


DIASTLNK 


55 


302-314 


SIVGTTASLQYMK 


56 


321-330 


YLLSEDISGK 


57 


331-335 


FSVDK 


58 


344-356 


MLTEIYTEDNFVK 


59 


365-371 


TYLNFDK 




382-393 


VNYTIYDGFNLR 




394-415 


NTNLAANFNGQNTEINNMNF IK 




418-427 


NFTGLFEFYK 


6.3 


433-438 


GIIISK 


439-440 


IK 




441-444 


SLDK 


7^ 66 



Figure 2 


















478-536 


GEEITSDTNIEAAEENISLD LIOOYYLTFNFDNEPENISI 




FMT <\<inTTnnT ft mpmtfr 




YTMFHYT R 

I 1 1 VA F 0 I JL>I\. 






SR 




J 00- JO i 


TAT TNI^VNFAT T NPSR 


TTT 1 1 




vytff<;<nDYVK 


^ "70 
°* lot 


SQ7-626 


ATEAAMFLGWVEOLVYDFTD ETSEVSTTDK 




627-649 


lADITIIIPYIGPALNIGNM LYK 




6Sn.6X${ 


nnFVGAT TFSGAVTT 1 FFTP FIATPVLGTFALVSYIANIC 






VT TVOTTDNAT *nK 


l^o 


712-720 


YIVTNWLAK 


11 




VNTOTT^T TR 


THT 




FAT FNOAFATK 


IQ 


74S-7SQ 


A TTN YO YNO YTEEEK 


oO 


7/:n 777 
/OU- / /Z 


>j>jrNJF>JTririT ^^K" 




77-3 770 


T NF^srMT^ 




787 520/^ 
/ o /-oUO 


FT Nnr<sV<^YT MN^MTPYGVIC 


^ ^> 


OAS SI ^ 
oUo-o 1 0 


T Fr>Fr>A<sT T^ 




OZO'OJO 


nxT TnnvDR 


0^ cjtr 


fi41-X^^ 


VNNTLSTDTPFOLSK 


od? 


R^7-871 

OOZ-Q / 1 


T T ^TFTFYTT^ 

1 r 1 AJ^ I AlV 


^ ^7 


877-8R7 
o /z-ooz 


>JTrNIT^TT "WT R 


^ oS 




YESNHLIDLSR 

A AJOA 1 A AA^AA^Ai^wXX 






YASK 






IIN IVJ^OXV 


I V A 1 


01 7-07 


MOTOT FNT F^SK 

IN V^lV^l-»r IN ld>Cf3 


n. J- 




NATVYNSMYFNFSTSFWTR 


TO ci^ 


0<7 07^ 


YFTsI^T^T lsrMFYTTT>JPMF>JN ^GWTC 

I riN O10l-#1N IN Aj a I lill V^IVAA^ININ OVJ VV IV 




076-004 


V^T NYGFTTWTT ODTOEIK 

V JA^il A VJA^AA V¥ I L^\JU 1 ^^AJiAAV 


1 v«» 




Y^inMrwT^T^YrwR 




1 ni4-l 07 


WTFVTTIlsJlSJR 
W IF V i. i 1 IN IN rv 


TVT ^ ( 


1 UZ*!— 1 UZO 


T TsTIM^t^ 

i-dNlN O^AV 


4*f^ Ota 


101S-10S6 


L IDOKPTSNLGNIHASNNIM FK 

^AAa/y^AVA AO AN A-jVJ A^ A A A^^iJlN A^ AAVA A XV 




1062-1065 


DTHR 

A^jl_A AAV 


loo 


}(\Qfi 1008 


TW YDTsjn^M^srJTT \C 


44+ 161 


1 141.1 1 S6 


G^sVMTTNTYT NSSI YR 

VJO V AVA 1 I AN A A A^il jOOjLt A AV 




1 1 C7 1 1 <Q 

1 1 J /-1 1 jy 




A A ^ 


1 1 /CC 11 7 A 

1 1 OJ-i X /u 


VA<sn>JT^ 
I /V^vJlNlV 




1 19'?-1204 


T ATNASOAGVEK 




1205-1224 


ILSALEIPDVGNLSQVVVMK 




1225-1226 


SK 




1227-1234 


NDQGIINK 


^5 


1261-1269 


LVASNWYNR 




1274-1276 


SSR 




1277-1296 


TLGCSWEFIPVDDGWGERPL 


»t 105 



